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Abstract. The article considers the method of factor cluster analysis which allows automatically retrain the on- 
board recognition system of an unmanned aerial system. The task of informational synthesis of an on-board system for 
identifying frames is solved within the information-extreme intellectual technology of data analysis, based on maxi- 
mizing the informational ability of the system during machine learning. Based on the functional approach to modeling 
cognitive processes inherent to humans during forming and making classification decisions, it was proposed 
a categorical model in the form of a direct graph. According to this model, the algorithmic support of the information- 
extreme factor cluster analysis is developed. It allows automatically retrain the system when expanding the alphabet of 
recognition classes. According to this algorithm, the on-board recognition system preliminarily carries out the 
information-extremal machine learning of recognition classes of relatively low power. When new classes appear, their 
unclassified structured recognition attribute vectors form additional learning matrixes. After reaching a representational 
volume, additional learning matrix joins the input learning matrix and the on-board recognition system is retrained. 
Forming additional learning matrixes of new recognition classes is carried out by the agglomerative algorithm of clus- 
ter analysis of unclassified vectors by k-means clustering. As a criterion of optimizing machine-learning parameters, 
we used the modified Kullback criterion which is a functional of the exact characteristics of classification solutions. To 
increase the functional efficiency of factor cluster analysis, it is proposed to increase the depth of machine learning by 
optimizing the parameters of image processing frames. 
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Anotauin. Po3rmaqaeTbca MeTOA (akTOpHOTO KacTep-aHalli3y, AKHM O3BOIAe aBTOMATHYHO TepeHaByaTH 
OopToBy CHCTeMy PO3ITi3HaBaHHA Oe3iWIOTHOTO aBlalliiHOro KOMIMIeKcy MIpH iWeHTH*iKalil KagpiB, CPopMoBaHOTO 10 
ONTHYHOMY KaHasly WAd@poBoro 300paxKeHHaA periony. 3aqaua iHtopMaliiHoro CHHTe3y OOpToOBO! CHCTeMH AVIA ijeH- 
THOikalli KalpiB po3B’A3yeTBCA B paMkax IH*OpMaLliMHO-eKCTpeMaJIbHO! iHTeeKTyaJIbHO! TeEXHOOMli aHai3y aHHx, 
aka Oa3yeTbCd Ha MaKCHMI3allll IHPOPMAaLMHO! CIPOMOXKHOCTI CHCTeMH B NIPOLeci MaluMHHOTO HaByaHHA. Y paMkax 
(PYHKWIOHAIbHOTO MiAxXoAy WO MOseOBaHHA KOTHITHBHHX TpOuUeciB, IPHTaMaHHUXx JOAHHI pH PopMyBaHHi Ta 
MIpHWHATTI KNacu*ikaliMHHXx PileHb, 3alpOMOHOBaHO KaTeropiliHy MOJeJIb y BUTIAL OpicHTOBaHOro Tpady. 3riqHo 3 
KaTeropiHOIO MOJeIIKO po3poOseHO alITOpHTMigHe 3a0e3lledeHHA iH(OpMalliifHO-eKCTpeMasIbHOrO (bakKTOpHOrO 
KlacTep-aHasli3y, WO TO3BOJIA€ ABTOMATHYHO TlepeHaByaTH OopToBy CHCTeMy pO3Ili3HaBaHHA TPH posliMpeHHi asida- 
BITY KaciB pO3Ii3HaBaHHA. 3a UMM aJIrOpHTMOM OopToBOi CHCTeMH pO3IIi3HaBaHHA TOMepeHbO 3/lCHIOE 
iH(OpMalllMHO-eKCTpeMaJIbHe MallMHHe HaBYaHHA 3a ai:aBITOM KAaCiB PO3Ii3HAaBaHHA BIHOCHO MaJIOi MOTY2KHOCTI. 
IIpu osBi HOBHX KaciB po3mi3HaBaHHA 1X HekKacHikoBaHi CTPpyKTypOBaHi BEKTOPH O3HaK PO3Ili3HaBaHHA YTBOPIO- 
IOTb JOMaTKOBI HaBuasIbHi MaTpuul. JloqaTKoBa HaByasIbHa MaTPUUA, 1Ka OcAra€ pelipeseHTaTHBHOTO OOcaAry, UpHEL- 
HY€TBCA JO BXIQHOi HaBYaJIbHOI MaTPHLi Ta 3AiMCHIOETBCA NepeHaByaHHA OopToBoi CHCTeMM posmi3zHaBaHHsa. DopMy- 
BaHHA JONaTKOBUX HaBYasIbHHX MaTPHUb HOBHX KJIACiB PO3Ii3HaBaHHA 3IMCHIOETECA 3a arIOMepaTHBHMM aJIropuT- 
MOM KlacTep-aHalli3y HeKjIacH PiIKOBaHHX BeKTOpiB 3a MeTOOM k-cepemHIx. Ak KpHTepili ONTHMI3al[il NapaMeTpiB Ma- 
UIMHHOTO HaByaHHaA OyO BUKOpHcTaHo MoyupikoBaHHu KpuTepiti KynpOaka, «kui € (PYHKUIOHAIOM Bi, TOUHICHUX 
XapaKkTepHcTHK KulacudikaliMHux pillenb. Jia MABMO,eHHA (PYHKWIOHAIbHOi ec:beKTHBHOCTI (akTOpHOrO KacTep- 
aHalli3y MpOnNOHyeTbcaA 30iIbLIMTH rIHOMHY MalMHHOrO HaBYaHHA WWIAXOM OMTHMI3alii MapamMeTpis oOpobseHHA 
300paxKeHb KaJIpis. 

Ksro4o0Bi cs10Ba: iH@opMalliiHo-eKcTpeMaIbHe MallIMHHe HaB4aHHA; ieHTu*iKallia; Kap WudpoBoro 
300paxKeHHA periony. 
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Introduction 

Unmanned aerial systems are widely used 
to search ground vehicles. Therefore, the urgent 
task is to provide the onboard recognition sys- 
tem (ORS) with independence based on ma- 
chine learning and pattern recognition [1]. The 
main approach to detecting ground ob-jects is 
the use of descriptive methods. Their main 
disadvantage is the uncertainty during 
recognizing objects of the same sizes. The use 
of neural-like structures for the analysis of 
digital images of ground objects also does not 
solve the problem of increasing the functional 
efficiency of the ORS due to their sensitivity to 
the multi-dimensional dictionary of recognition 
signs. As one of the promising directions of in- 
creasing the ORS functional effectiveness is ap- 
plying ideas and methods of the so-called 
information-extreme intellectual technology 
(IEI technology) of data analysis [8]. Its me- 
thods simulate the cognitive processes inherent 
to humans when forming and making classifica- 
tion decisions in the most appropriate manner. 
[23 

The article is devoted to the information- 
extreme method of ORS machine learning to 
identify the digital image frames of the observa- 
tion region, which operates in the mode of fac- 
tor cluster analysis (FCA) [7]. 

Research objective 

Let us consider a formalized statement of 
the problem of informational synthesis of an on- 
board frame identification system that operates 
in the FCA mode. There is formed an alphabet 
{X7,|m = 1, M} of recognition classes charac- 
terizing the image frames of the area. For each 
recognition class a three-dimensional learning 


matrix || ra Al of pixel brightness of the frame 
receptive field is formed, where the row 
fyi = 1, N}, N — number of recognition 
features, is as structured feature vector of the 
corresponding recognition class (hereinafter, 


simply implementation), and the matrix column 


— random training set {yJ : 


j= 1, n} with 
volume n. 
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It is known that the concept of IEI 
tech-nology involves transforming the input 
learning matrix Y into a working binary 
matrix X, which changes during machine 
learning. Therefore, for the Hamming binary 
space, a vector of functi-oning parameters is 
set. They affect the functi-onal efficiency of 
ORS machine learning in 
recognizing recognition class 
implementations X77: 


Im =<Xm Im, 6 >, 

() 
where xm — implementation ensemble- 
averaged vector, its vertex defines the center 
of the hyperspherical container of the recog- 
nition class X77; d,, — radius of hperspherical 
container recognition class X7,, which is re- 
stored in the radial basis of the recognition 
feature space; 5 — a parameter the of which 
is equal to half the symmetric field of the 
control tolerances divided by recognition 
signs which are the brightness values in pi- 
xels. The following restriction are imposed 
on the parameters of system functioning, 
further called the parameters of machine 
learning: 

— the range of pixel brightness values is in 
the range [0; 255] of tone gradation; 

— recognition radius container range *» is 
given by inequality dj, < d(xm ® x<), 
where d(x» ©®%X,-) — center-to-center 
distance between the implementation *m 
and the nearest implementation x, of the 
next class X?; 

— parameter range space 6 is given by ine- 
quality 6 < 6,,/2, qe 64 — standard tole- 
rance limit for recognition features. 

It is necessary to optimize the vector 
parameters (1) at the stage of ORS machine 
learning, which provide the maximum value 
of the information optimization criterion in 
the working (accessible) area of determining 
its function: 
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where E& )— the value of information crite-rion 
for optimizing learning parameters of the 
system to recognize class implementa-tions X7, 
calculated on the k learning step; G,- 
workspace for calculating an informa-tion 
criterion; {k}— number of learning steps. In the 
exam mode basing on_ unclassified im- 
plementations of recognition classes which 
characterize new frames of the image it is 
necessary to form additional learning mat-rixes. 
After reaching the representational vo-lume 
they join the input learning matrix and, in such 
a way, the ORS is retrained. 

Computational model 

In terms of functional approach, the cate- 
gorical model of information-extremal machine 
learning is built in the form of a directed graph 
[4]. Thus, the input mathematical formulation 


GxTxZ oe are 


_ 


yisenl 


of the category model is presented in the 
form of a following structure 

AB =<G,T,Z, 2, Y,X; F, ®, € >, 
where G — factors influencing the ORS; T — 
time points for receiving information; Z— 
recog-nition features dictionary; Z — space of 
system states, (alphabet of recognition 
classes); — input multidimensional learning 
matrix; X— working binary learning matrix; 
F:GxXTXZ-—X2-— operator for forming 
recognition features dictionary; ®—operator 
for forming the input learning matrix 
Y; 0,: Y — X— operator for transforming Y 
matrix into a working binary matrix X; é— 
operator for expanding the matrix Y. 

Figure | shows categorical model of 
informational and extreme machine learning 
for ORS in the FCA mode. 


Fig. 1. FCA categorical model 


In the circuit that simulates the operation 
of ORS in machine learning mode, the operator 
62 reflects the implementations of the working 
matrix into a fuzzy partition R!™! of recognition 
classes. Classification operator WY: R!M! = JI! 
checks the basic statistical hypothesis about the 
GV) 


mn tO Class 


belonging the implementation x 
Xmn and generates hypotheses I where [- 


number of statistical hypotheses. Basing on the 
results of statistical hypotheses operator 
y:t!4 7/41 forms a number of accuracy 
characteristics I!7!, where q = 1%. Operator 
:1'9| + E calculates the set of values of the 
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information criterion for optimizing 
machine-learning parameters. At each step 
of machine learning the operator in the radial 
basis of the feature space restores 
recognition containers that form a partition 
R'"|. The circuit for op-timizing acceptance 
tolerance for recognition features is closed 
due to term set D — accep-tance tolerance 
framework. 

In the circuit that simulates the opera- 
tion of ORS in the exam mode, or directly in 
the operating mode, the classification ope- 
rator of the examination recognition vector 
forms a composition, Ye = ',oW";, 
where the operator ¥’ calculates member- 
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ship functions and forms a term set F’, while the 
operator ¥” ; calculates decision rules. 

According to the exam results, a set of 
hypotheses /!”+1I is formed. Among them the 
hypothesis y,,,,means that the examination 
implementation does not belong to the alphabet 
of recognition classes {X7,|m = 1, M}. An 
operator &€ basing on _ unclassified 
implementations generates an additional 
learning matrix, and, after reaching the 
minimum representative volume, adds to the 
input learning matrix Y and starts the process of 
ORS retraining. Operators U, and Ug, regulate 
machine learning and exam _ processes 
accordingly. 

The categorical model (Fig. 1), built as 
part of a functional approach to modeling cog- 
nitive processes for the formation of classifica- 
tion decisions, is a generalized scheme of the 
information-extreme FCA algorithm. 

Implementation notes of information- 
extreme factor cluster analysis algorithm 

The ORS machine learning algorithm 
with optimization of acceptance tolerance for 
recognition features is represented in the form 
of a two-cycle procedure for finding the global 
maximum of information criterion [2]: 

dx = argmax{ max BO}, 
where 5 — optimal parameter of the acceptance 
tolerance field; 6;— allowed value area of para- 
meter 6 of acceptance tolerance field; {k}— time 
ordered set of machine learning steps. 

Based on the optimal geometric parame- 
ters of the containers of recognition classes ob- 
tained during machine learning, production de- 
cision rules are built in the form of: 


(VXp, © RIM) (vx © RIM) [if Gem > 
0)(Um = max{im) then x) € 
Xo, else xY) € X2], (4) 


where x4) — vector that is recognized; ,,— 
membership function of vector xY) to a recog- 
nition classX7,. 

In expression (4), the membership 
function for a hyper spherical container of 
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recognition class X7, is determined by the 
formula [3] 


— din Bx) 


=1 ; 
where x;,, dm— optimal machine learning 
para-meters: averaged binary 
implementation and hyper _ spherical 


container radius, respectively. 

When new recognition classes appear 
in the exam mode, their unclassified 
structured im-plementations form additional 
learning mat-rixes. An additional learning 
matrix, which reaches a_ representative 
volume, joins the input learning matrix and 
the ORS is retrained using an information- 
extreme algorithm. Forming additional 
learning matrixes of new recognition classes 
is carried out through an agglomerative 
algorithm of cluster analysis [5] of 
unclassified implementations. Accor-ding to 
this algorithm, the top point of the 
unclassified implementation is taken as the 
center of the recognition class X74, around 
which the area of the corresponding radius is 
set. If another classified implementation falls 
into this area, the new center of the class 
Xm+1 iS determined according to the k- 
means clustering? An unclassified implemen- 
tation does not fall into the class region, 
forms the center of a new class X7742. It con- 
tinues until a certain number of clusters are 
built. 

After that, the cluster radiuses increase 
and k-means clustering is implemented 
again. The clustering process continues until 
a representative additional learning matrix of 
a new recognition class is formed, which is 
connected to the input learning matrix and 
starts the process of ORS retraining. 

Simulation results 

According to the procedure (3), an al- 
gorithm for ORS information-extreme ma- 
chine learning in the FCA mode was deve- 
loped and implemented. As a criterion for 
optimizing the vector parameters (1), we 
used the modified Kullback criterion [2], [6]: 
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pW 22, an+10-7— [Ki +KS| 
m= 70921 Tao | 
k k 
(Kf? + Ky”)], (5) 
where eee — a number of events when 


implementations belonging to a class X7, are 
not related to it accidentally; K. OO a number of 
events when the implementations of the 
neighboring recognition class Xf? are 
accidentally related to recognition class X?,; 
10°?— a sufficiently small number to avoid 
divide by zero. 

In our example, as a source image we 
used sand pit fragment obtained from air- 
photography. As recognition classes, we have 
chosen such image sections as: class X? — dense 
forest; class X?— sand plot; class X?— road. The 
input learning matrix was formed by sequential- 
ly reading out the brightness values in pixels of 
the receptive field of each 50 X50 pixels 
frame. 

Figure 2 shows the dependency graph of 
the alphabetically averaged recognition class of 
information criterion (5) on the parameter 6 of 
the acceptance tolerance field obtained during 
the ORS machine learning. 


fo 


0 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 


Fig. 2. Dependency graph of the criterion from 
the parameter of acceptance tolerance field 


In Fig. 2 and further in the text, double 
shading denotes the working (allowed) domain 
of defining criterion function (5), where va- 
lues i and Ke are less than 0.5. Analysis 
of Fig. 4 shows that the optimal parameter value 
of the acceptance tolerance field is equal to 6 * 
= 32 of tone gradation under the criterion ma- 
ximum value E,n7g, = 4,20. 
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Based on the optimal parameters of the 
recognition class containers, the decision 
rules (4) were built and the image frames of 
the region were identified. The analysis of 
the image digitized by the numbers of 
recognition classes showed that part of the 
frames indicated by 0 was not assigned to 
any of the recognition classes. 

In order to verify the FCA algorithm, 
we input the implementations of three new 
frames: class X? (soil), class X2 (sparse 
forest) and class X2 (earthroad) in the ORS, 
functioning in the exam mode. 

After ORS retraining the learning 
matrix for six recognition classes, the 
maximum value of the alphabetically 
averaged recognition classes of the 
information criterion de-creased to 1.85, 
which is associated with an increase of 
intersection of recognition clas-ses in the 
feature space. To increase func-tional 
efficiency, the depth of machine lear-ning 
was increased by optimizing the accep-tance 
tolerance according to the scheme, in- 
volving sequential optimization of accep- 
tance tolerance for all features. In such a 
case, acceptance tolerances obtained during 
parallel optimization were taken as starting 
ones. 

Figure 3 shows a graph of changing 
the information criterion during the process 
of machine learning with sequential optimi- 
zation of acceptance tolerances. 
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Fig. 3. Graph of changing the information 
criterion during sequential optimization of 
acceptance tolerances 
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Analysis of Fig.3 shows that the 
information optimization criterion on the fourth 
iteration, the number of which is determined by 


the ratio of numbers of iterations /to the 
number of features N, has reached its maximum 
value Eynax = 4,10. In order to build decision 
rules (4), the optimal parameters of the 
recognition class containers were determined 
according to the dependence graphs of criterion 
(5) on the container radiuses were determined. 

Figure 4 shows a digitized image of 
a region obtained with decision rules (4). 


Fig. 4. Digitized image of a sand 
pit obtained according to the results 
of frame identification 


Analysis of Fig.4. shows that image 
frames with the number 0 contain exclusively 
images of objects of non-natural origin, for 
example, buildings or vehicles. The identifica- 
tion of image frames of the region allows deter- 
mine the range of interest most probably con- 
taining the ground object of non-natural origin. 

Summary 

Within the functional approach, a catego- 
rical model is proposed, based on which an 
information-extremal machine learning algo- 
rithm for ORS in the FCA mode is developed. It 
allows automatically retrain the system when 
expanding the alphabet of recognition classes. 
The use of decision rules built within the geo- 
metric approach allows increasing the effici- 
ency of making classification decisions due 
their low computational complexity. In addi- 
tion, such decision rules are almost invariant to 
the multi-dimensionality of the space of recog- 
nition features. 

The developed method for identifying 
image frames in a region allows to determine 
areas of interest which may contain ground ob- 
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jects of unnatural origin. To increase the 
functi-onal efficiency of the ORS, it is 
necessary to increase the depth of machine 
learning by optimizing additional parameters 
of machine learning. 
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